arXiv: 1505.0533lv2 [quant-ph] 3 Oct 2015 


Goerz et al. 


RESEARCH 


Hybrid Optimization Schemes for Quantum 
Control 

Michael H Goerz^, K. Birgitta Whaley^ and Christiane P Koch^* 


Correspondence: 
christiane.kochOuni-kassel.de 
^Theoretische Physik, Universitat 
Kassel, Heinrich-Plett-Str. 40, 
D-34132 Kassel, Germany 
Full list of author information is 
available at the end of the article 


Abstract 

Optimal control theory is a powerful tool for solving control problems in quantum 
mechanics, ranging from the control of chemical reactions to the implementation 
of gates in a quantum computer. Gradient-based optimization methods are able 
to find high fidelity controls, but require considerable numerical effort and often 
yield highly complex solutions. We propose here to employ a two-stage 
optimization scheme to significantly speed up convergence and achieve simpler 
controls. The control is initially parametrized using only a few free parameters, 
such that optimization in this pruned search space can be performed with a 
simplex method. The result, considered now simply as an arbitrary function on a 
time grid, is the starting point for further optimization with a gradient-based 
method that can quickly converge to high fidelities. We illustrate the success of 
this hybrid technique by optimizing a geometric phase gate for two 
superconducting transmon qubits coupled with a shared transmission line 
resonator, showing that a combination of Nelder-Mead simplex and Krotov’s 
method yields considerably better results than either one of the two methods 
alone. 

Keywords: quantum control; optimization methods; quantum information 


1 Introduction 

Coherent quantum control has long been a key component in the effort towards 
future quantum technologies. It relies on the interference between multiple path¬ 
ways to steer the quantum system in some desired way [1, 2]. Originally conceived 
for the control of chemical reactions [3, 4, 5], it has since been extended to a wide 
variety of applications, see Ref. [6] for a review. In this context, numerical opti¬ 
mal control theory (OCT) is a particularly powerful tool. OCT follows an itera¬ 
tive approach, improving the control field in each iteration to steer the dynamics 
to the optimization target. Generally, the fastest converging algorithms are those 
that take into account information about the gradient of the optimization func¬ 
tional with respect to variations in the control. Gradient-based methods assume 
an open-loop setup, where the entire optimization procedure is performed based 
on the knowledge of the system dynamics. Two widely used methods are Krotov’s 
method [7, 8, 9] and gradient ascent pulse engineering (GRAPE) [10]. Krotov’s 
method guarantees monotonic convergence for time-continuous control fields. The 
Limited memory Broyden-Fletcher-Goldfarb-Shannon (LBFGS) method [11] can be 
used to extend both GRAPE [12] and Krotov’s method [13], considering not only 
the gradient but also an estimate for the Hessian, i.e., the second order derivative. 
This has been demonstrated to improve the convergence in some instances, in par¬ 
ticular when close to the optimum [14]. In experimental setups that only allow for 
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limited control and knowledge of the dynamics, closed-loop control schemes have 
often been preferred [15]. There, the controls are updated based only on a measure¬ 
ment of the figure of merit, e.g. using genetic algorithms [16] or other gradient-free 
optimization methods. 

More recently, gradient-free optimization methods have also been employed in 
an open-loop context, prompted by the observation that evaluation of the gradient 
in many-body systems is often numerically infeasible. The chopped random basis 
(CRAB) method [17, 18] has been formulated for such applications. It expands 
the control in a relatively small number of randomly chosen spectral components 
and then applies a Nelder-Mead simplex optimization to the expansion coefficients. 
In principle, gradient-free methods are applicable if the control can reasonably be 
described by only a few free parameters and the optimization landscape has no local 
minima in the vicinity of the initial “guess”. 

Optimal control theory is particularly relevant for quantum information process¬ 
ing. Both Krotov’s method and GRAPE have been extensively used to obtain high- 
fidelity quantum gates [19, 20, 21, 22, 23, 24]. Short gate durations are crucial, in 
order to minimize detrimental effects of decoherence. With OCT, this is achieved by 
systematically decreasing the gate duration until no solution can be found [25, 26], 
thus operating at the quantum speed limit [27, 28]. Moreover, OCT may be used 
to actively minimize the effects of decoherence [29, 30], and to increase robustness 
with respect to classical noise [31]. Robustness is a requirement that is generally 
difficult to fulfill with analytical approaches. 

Here, we explore the possibility of combining gradient-free and gradient-based 
methods at different stages in the optimization, exploiting the benefits of each 
method. The application of a simplex optimization to a guess pulse, described 
by only a few free parameters, efficiently yields a comparably simple first opti¬ 
mized pulse of moderate fidelity. This pre-optimized pulse then provides a good 
starting point for further optimization using a gradient-based method. The second 
optimization stage relaxes the restrictions on the search space implied by a simple 
parametrization and may then quickly converge towards a high fidelity. The simplex 
pre-optimization addresses the observation that typically in direct gradient-based 
optimizations, due to the large size of the search space, the majority of the numeri¬ 
cal effort is spent in “getting off the ground”. Pre-optimization thus allows to locate 
a region of the search space in which the gradient is large enough to provide mean¬ 
ingful information. Such good guess pulses may sometimes be designed by hand, 
but this requires a very good intuition of the underlying control mechanisms. We 
propose here instead to simply prune the search space for the initial phase of the 
optimization by reducing the complexity of the control. The second optimization 
stage can then more easily identify high fidelity solutions. 

We illustrate the use of such hybrid optimization schemes by optimizing a quan¬ 
tum gate on superconducting qubits, using an example inspired by the recently 
proposed resonator-induced phase gate [32]. Superconducting circuits are a prime 
candidate for the implementation of quantum computing, due to the flexibility 
in qubit parameters, their inherent controllability, and the promise of scalability. 
Moreover, with recent advances in the transmon architecture, decoherence times 
are approaching 0.1 ms [33], allowing to reach fault-tolerance with sufficiently fast 
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gates. However, the flexibility and large number of different gate mechanisms [34] 
also imply a challenge from a control perspective, as it is not immediately obvious 
what are good qubit parameters, or good guess pulses. This makes superconducting 
circuits especially well-suited for combining a coarse search using simplex methods, 
which may then be refined with a more powerful gradient-based method. 

The paper is organized as follows: In section 2, we review the realization of a 
geometric phase gate on a system of two transmon qubits coupled with a shared 
transmission line resonator, using all microwave control. In section 3, we first in¬ 
troduce a functional targeting the geometric phase gate, i.e., any diagonal perfect 
entangler, and then show the results of a direct optimization using the gradient- 
based Krotov method. In section 4, we apply the gradient-free Nelder-Mead simplex 
optimization to obtain pre-optimized guess pulses, which then become the starting 
point for an optimization with Krotov’s method. We compare the control pulses ob¬ 
tained by the gradient-based, gradient-free, and hybrid schemes, and the dynamics 
they induce, as well as the numerical effort necessary to obtain converged results in 
each scheme. Section 5 concludes. 

2 Model and Gate Mechanism 

We consider a system of two transmon qubits, coupled via a shared transmission 
line resonator (“cavity”) [35, 36]. The Hamiltonian reads 

A=E 

< 7 = 1,2 

where cjc, ^2 are the frequency of the cavity and the first and second qubit, re¬ 
spectively; 0 ^ 1 , 0^2 are the qubit anharmonicities, and ^i, g 2 are the coupling between 
each qubit and the cavity. The operators a, bi, and 62 are the standard annihila¬ 
tion operators for the cavity (a) and the two qubits (bi, and 62 ), respectively. For 
numerical purposes, the Hilbert spaces for the qubit and cavity are truncated after 
6 , respectively 70, levels. It has been verified that the inclusion of additional levels 
yields no significant change in the results of the subsequent sections. The parameters 
take the values listed in Table 1. The system is driven by the microwave field e(t), 
with a pulse duration T. An off-resonant pulse results in a state-dependent shift of 
the resonator frequency [32]. For a slowly-varying pulse shape with e(0) = e(T) = 0, 
such that the level shifts occur adiabatically, the dynamics result in a geometric 
phase on each of the qubit levels. That is, the resulting gate takes the diagonal 
form 




CXq 


bgbgbqbq 


“I” “I" +Wca^a + e*(t)a + e(i)a^, (1) 


0 = diag . 


The maximal reachable concurrence of such a diagonal gate is 


C{i) = 



7 = 000 “ 010 ~ 001 + 011 5 


( 2 ) 

( 3 ) 


where 7 defines the non-local two-qubit phase [26] . The concurrence is obtained from 
the theory of local invariants for two-qubit gates [37, 38]. The local invariants for 
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a diagonal gate evaluate to Gi = 008 ^( 7 / 2 ) and G 2 = 1 + 2 008 ^( 7 / 2 ). From tlie 8 e, 


the Weyl chamber coordinate 8 may be calculated a 8 ci = 7/2, C2 =03 = 0 [38]. 
Following Ref. [39] , the concurrence i 8 evaluated a 8 a function of the Weyl chamber 
coordinate 8 to yield Eq. (3). The gate i 8 a perfect entangler for 7 = tt. 

We con 8 ider puEe 8 hape 8 of the form 



( 4 ) 


with a fixed driving frequency ujd given in Table 1. For 8implicity, we neglect the 
depha8ing induced by high cavity population8 which allow8 8olution of wave-packet 
dynamic8 with the time-dependent Schrodinger equation including microwave con¬ 
trol field8. For an arbitrarily cho8en gue88 puEe of duration T = 200 n8 and peak 
amplitude Eq = 300 MHz, the population dynamic8 re8ulting from the initial con¬ 
dition |T(t = 0)) = 100) are 8hown in paneE (a)-(d) of Fig. 1. 

If the pul8e induce8 adiabatic dynamic8, it 8hift8 the qubit and cavity leveE pro¬ 
portionally to e(t)^/A where A i8 the detuning from the re8pective level [32]. In the 
original field-free frame, thi8 i8 equivalent to 8hifting the initial wave packet propor¬ 
tionally to the 8quare of the puEe. Thu8, the excitation and population dynamic8 
8hould 8moothly follow the puEe 8hape. Specifically, the condition for adiabaticity 
i8 that if the puEe 8hape were to be 8tretched in time, the population dynamic8 
would 8imply 8tretch corre8pondingly. For a given peak amplitude, the larger the 
detuning of the drive ujd from the frequencie8 cjc, ^ 2 , ^ 1 , the 8maller the excitation 
in the re8pective Hilbert 8pace. Since the drive i8 detuned by only 40 MHz from 
the cavity, the cavity excitation, panel (a), reache8 a large value (n) « 30. The 
far-detuned right qubit, panel (b), and even farther detuned left qubit, panel (c), 
only 8how a 8mall excitation. A8 indicated by the 8tandard deviation8 8hown a8 
8haded area8 in paneE (a)-(c), the excitation remain8 relatively localized in energy. 
It i8 noteworthy, however, that the excitation curve8 (8pecifically of the cavity) 
8how 8ome imperfection8. The8e “wobble8” can be interpreted a8 deviation8 from 
the expected adiabatic dynamic8, e.g. jumping over an avoided cro88ing between 
highly excited cavity 8tate8. Thi8 ultimately re8ult8 in a 8mall I 088 of population 
from the logical 8ub8pace, a8 the 8y8tem doe8 not perfectly return to it8 original 
8tate. The I 088 of population for the given example i8 £pop = 5.9 x 10“^. While 8uch 
a 8mall deviation i8 not di8cernible in the plot of the population in panel (d), it i8 
nonethele88 a 8ignificant error in the objective of obtaining high fidelity gate8 below 
the quantum error correction limit, typically re8ulting in gate error8 below 10“^. In 
principle, cavity population can be 8uppre88ed by tuning the qubit parameter8 and 
U8ing non-trivial puEe 8hape8 [32]. 

The dynamic8 for the remaining two-qubit ba8i8 8tate8 are 8imilar to tho8e of the 
100) 8tate in Fig 1. The concurrence of the gate implemented by the propagated 
gue88 pul8e, according to Eq. (3), yield8 a value of G ^ 0.8. Thi8 implement8 the 
clo8e8t diagonal perfect entangler with an average gate error of 8.3 x 10“^. Both the 
I 088 of population from the logical 8ub8pace and the 8mall value of the generated 
entanglement imply that the cho8en puEe parameter8 are 8ub-optimal with re8pect 
to the de8ired geometric pha8e gate. We therefore turn to numerical optimal control 
to obtain a high fidelity gate. 


Goerz et al. 


Page 5 of 17 


3 Direct Optimization with Krotov's Method 

3.1 Optimization Functionals and Method 

The standard approach for implementing a specific quantum gate O using optimal 
control theory is to maximize the overlap between the time evolution U(T, 0;e(t)) 
under the control e(t) and the target gate [19]. For a two-qubit gate, this is com¬ 
monly expressed in the final time functional [40] 


Tsm 

J 'Y' 


1 - — 
16 


y](fc6^U(T,0;e(t)) 


k=l 


|fc) e{|oo),|oi),|io),|ii)}, (5) 


which goes to zero as the target gate O is implemented, up to a global phase. 

Krotov’s method allows to iteratively improve the control field, changing the con¬ 
trol to the updated control in the i’th iteration. The final time 

functional Jt^ given e.g. by Eq. (5), is augmented with a running cost to result in 
the total functional 

J[e«(i)] = Jr({dbr)})+ r ga[e^^\t)]dt, (t)) = (t))\k) . (6) 

Monotonic convergence is ensured by the choice [40, 9] 

9 a[e^^{t)] = ^(Ae(t))^ , Ae(t) ^ , (7) 

where is an arbitrary scaling parameter and S{t) G [0,1] is a shape function that 
ensures smooth switch-on and switch-off. 

The iterative update scheme is then given in terms of three coupled equations [9] , 


Ae(i) 



S{t) 


Im 


E Uibi) 


Lfc=i 


/m 








de 




^(i+l)(t) 


i r|t(») 




xfW), xt\T)) = 


( 0 / 


OJt 


d{(pk\ 




(8a) 

( 8 b) 

(8c) 


with \A(pk{t)) = — l'/’fc^(^))- The second order contribution to the update, 

with the prefactor cr(t), is required for certain types of functionals [9], as we will 
see below. For the choice of J|F, we may set a{t) = 0 [9]. 

For the gate mechanism outlined in Section 2, we obtain a phase on each of the four 
logical basis states. These phases should combine to produce a perfect entangler, 
with 7 = TT according to Eq. (3). The individual phases 0io and (pu depend 

delicately on the shape, amplitude, and duration of the pulse. It is therefore not 
known a priori for a given guess pulse which exact geometric phase gate will or can 
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be reached, and thus what should be the target gate O of the optimization. For a 
gate Uo induced by a guess pulse, we may construct the closest diagonal perfect 
entangler by numerically evaluating 

6= argmin ||6diag(</>oo, 001,010) - Ooll, (9) 

000 , 001,010 


with 


Odiag( 0 OO 5 001 : 0io) — diag 


o*0OO 


• 1 p*0io ph7^+0oi+0io —0oo) 

, c- , c- , 


( 10 ) 


which includes the condition 7 = tt to make the gate a perfect entangler. Using 
this gate as a target fully determines the optimization problem, with two caveats. 
First, the target gate depends on Uq induced by the (arbitrary) guess pulse, and 
second, the construction of the closest diagonal perfect entangler does not take into 
account the topology of the optimization landscape; the “closest” gate is by no 
means guaranteed to be the one that is easiest to reach. 

An approach that addresses these issues is to go beyond the standard functional 
of Eq. (5) and formulate a functional that targets the properties of the geometric 
phase gate specifically, without stipulating the phases on all of the logical states. 
We split the functional into two terms, 

Jr = ^Vdiag + Jd, (11) 

where Jdiag goes to zero if and only if the gate is diagonal (with arbitrary phases) 
and goes to zero if and only if the gate is also a perfect entangler, 7 = tt. Thus, 
the functional is conceptually similar to a recently proposed functional targeting 
an arbitrary perfect entangler [41]. However, the additional restriction to enforce a 
diagonal gate is important, as the Hamiltonian also allows for non-diagonal gates, 
but only through undesired non-adiabatic effects. 

The two terms take the form 


^diag — 4 tqo'Tqq tqiTq]^ HoUo 5 (12a) 

= 2 + Too To 1 Tio n 1 + Too To 1 Tio Ti 1 , (12t>) 


with 

TOO = (oo|0|oo) , Toi = (01|0|01), no = (10|0|10), m = (ii|0|ii). (i2c) 
The construction of Ty is based on the observation that 
7 = TT 2 + -h = 0 , 

which becomes Eq. (12b) by associating n with and using the definition of 7 
in Eq. (3). Both Jdiag and take values G [0,4], hence the normalization factor | 
in Eq. (11) to bring the value of the functional closer to that of J|U. 
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In contrast to the functional is not convex, since the states enter in 
higher than quadratic order. The Krotov update equation (8a) must then include 
the second order contribution, where a{t) can be determined numerically in each 
iteration as [9] 


cr(t) = —max(e^, 2^4 + e^), A = 


2 ELi Re [{xk{T)\AMT))] + AJt 


(13) 


with a small non-negative number ca, and AJt = ~ '^T{{(t>k\T)}). 

The boundary condition for the backward propagated states in Eq. (8c) yields 


Ixoo(T)) 

= (too - ToiTioTii) |00) , 

(14a) 

Ixoi(T)) 

= (toi - TooTioTii) |01) , 

(14b) 

IXio(T)) 

= (tiO - TooToim) |10) , 

(14c) 

\Xu{T)) 

= (tii -ToVoiTlo) |11). 

(14d) 


Both and are only loosely connected to the average gate fidelity that 
is accessible to experimental measurement. In the case of a two-qubit gate and 
non-dissipative dynamics this can be evaluated as [42] 


avg 


/ 




6^0 




2 1 
dT= — 
20 


tr 


6^0 


+ tr 


6 ^ 00^6 


(15) 


Thus, Favg, respectively the gate error ^avg = 1 — Tavg, provides a well-defined 
measure of the optimization success independent of the choice of optimization func¬ 
tional. For an optimization with we may evaluate 5avg with respect to the 

closest geometric phase gate resulting from propagation with the optimized pulse, 
according to Eq. (9). 


3.2 Optimization Results 

The optimization starts from the guess pulse described by Eq. (4), with T = 200 ns 
and Eq = 300 MHz, as discussed in Section 2, with the dynamics shown in panels 
(a)-(d) of Fig. 1. The gate error with respect to the closest geometric phase gate for 
this guess is ^avg = 8.3 x 10“^, with a loss of population from the logical subspace 
of ^pop = 5.9 X 10“^. The concurrence error, defined as = 1 — C", takes the value 
1.9 X 10-^ 

Optimization using Krotov’s method and as the optimization functional con¬ 
verges within 5516 iterations of the algorithm. Convergence is assumed when the rel¬ 
ative change of the functional AJt/Jt falls below 10“^, such that no significant fur¬ 
ther improvement is to be expected. The gate error is reduced to ^avg = 1.4 x 10“^. 
It is dominated by the remaining loss of population from the logical subspace, 
^pop ~ ^avg 5 as the concurrence error is only = 1-8 x 10“^, see Table 2. 

The resulting optimized pulse is shown in Fig. 2, with the guess pulse indicated 
by the dashed line. For the center 100 ns, there are only small deviations from the 
guess pulse (both in shape and phase). Significant deviations occur only at the very 
beginning and end, most notably the variation in the complex phase (center panel) 
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between 20 and 40 ns. The spectral width of the pulse (bottom panel) remains well 
within a bandwidth of ±50 MHz. The dynamics of the |00) state under the optimized 
pulse are shown in panels (e)-(h) of Fig. 1. The difference to the dynamics under 
the guess pulse, panels (a)-(d), is striking; the excitations no longer smoothly follow 
the pulse shape, but show strong oscillations on top of the expected behavior. The 
features at the beginning of the optimized pulse provide a kick to the system, 
inducing oscillations in the populations, with a counter-kick near the end of the 
pulse. These kicks are very visible in the population of the |00) state in panel (h), 
compared to the smooth dynamics for guess pulse in panel (d). 

It is worth noting that the gate obtained with the optimized pulse is not the closest 
geometric phase gate to the gate implemented by the guess pulse, as in Eq. (9). This 
illustrates the benefit of using over J|±. The latter optimizes towards a specific, 
pre-determined gate, according to Eq. (9), while can dynamically adjust which 
specific geometric phase gate is easiest to reach, allowing it to fulfill the objective 
much more easily. Table 2 shows that optimization with requires significantly 
less propagations (which are directly proportional to CPU time) than optimization 
with J|U, for both direct and pre-optimized strategies. 

While the optimization yields a gate error well below the quantum error correction 
threshold, it deviates significantly from the simple geometric phase gate scheme, re¬ 
sulting in complex dynamics. The numerical effort required to obtain a high fidelity 
solution is considerable, with several thousand iterations (each iteration requiring 
two full propagations of four logical basis states). We have discussed here only the 
optimization for a fixed gate duration of T = 200 ns. Generally, significantly faster 
quantum gates could potentially be implemented using other mechanisms, e.g. [24]. 
The geometric phase gate, however, relies on adiabatic shifts of the energy levels 
such that loss of population from the logical subspace inhibits realization of high 
fidelities when pushing the gate duration significantly below 200 ns. 

The complexity of the optimized pulse is typical for Krotov’s method or other 
gradient-based optimization methods. Eor the present example, this clashes with 
the gate mechanism of the geometric phase gate that intends to use simple and 
smooth pulse shapes. 


4 Hybrid Optimization Scheme 

4.1 Simplex Optimization 

The gate error of the guess pulse is dominated by the insufficient amount of entan¬ 
glement that is generated. A natural approach is to maintain the analytical pulse 
shape of Eq. (4) for the time being, and to vary the free parameters Eq and T 
in order to maximize the figure of merit. Such an optimization of a pulse deter¬ 
mined by only a handful of parameters (in this case two) is easily performed using a 
gradient-free method such as Nelder-Mead simplex. This has the additional benefit 
that there are no restrictions on the choice of optimization functional. Specifically, 
there is no need to formulate it in such a way that derivatives can be calculated 
analytically. Thus, we can include the objective of minimizing the gate duration in 
the functional and modify Eq. (11) to read 


j-splx 


Jd 


lag 




To = 200 ns. 


(16) 
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Note that with the addition of penalizing the gate duration, the functional is no 
longer a distance measure that approaches zero as the target is reached; the simplex 
optimization will find a local minimum of Eq. (16) at a value of < 1. 

Using only 116 propagations, the algorithm converges to a solution that reduces 
the gate duration from 200 to 185 ns, while bringing the entanglement error down to 
Sc = 2.0 X 10“^, see table 2. The resulting dynamics are shown in panels (a)-(d) of 
Fig. 3. They are similar to those of the original guess pulse, cf. panel (a)-(d) of Fig. 1. 
The shorter pulse duration and larger pulse intensity (from 300 MHz in the original 
guess to ^ 400 MHz) results in a significantly larger cavity excitation. It also leads 
to more non-adiabatic defects (wobbles in the cavity excitation). Consequently, the 
loss of population from the logical subspace is increased by about a factor of two 
to Spop = 1.4 X 10“^, and limits the total gate error to s^vg = 1.4 x 10“^. Thus, 
while the simplex search yields a dramatic improvement over the original guess 
pulse, it does not reach a sufficiently high fidelity to approach the quantum error 
correction limit. To remedy this, we turn to a hybrid approach, combining simplex 
and gradient-based optimization. 

4.2 Continued Optimization with Krotov’s Method 

For the present example, we use the final pulse of the previous section as the starting 
point of an optimization with Krotov’s method. Since now the guess pulse already 
has a relatively high fidelity, both the and functionals may be used inter¬ 
changeably, as we are only searching in a very small vicinity of the starting point. 
Both methods converge rapidly to AJt/Jt < 10“^ in under 200 iterations. The 
dynamics resulting from the propagation of the |00) state under the pulse obtained 
from optimization with the J|U functional is shown in panels (e)-(h) of Fig. 3. The 
comparison to the dynamics of the pre-optimized guess pulse in panels (a)-(d) is 
striking: the excitations now follow the pulse shape smoothly. The non-adiabatic 
defects, i.e., the wobbles especially in the cavity excitation in panel (a), have been 
corrected. Consequently, the loss of population from the logical subspace is now 
reduced to a value of Spop = 1.1 x 10“^. Together with only a slight increase in the 
concurrence error to = 5.1 x 10“^, the overall gate error of the optimized pulse is 
^avg = 3.4x10“^. This is an improvement of half an order of magnitude compared 
to the direct optimization in section 3.2. Moreover, the result has been obtained at 
a small fraction of the numerical cost. The pre-optimized guess and post-optimized 
pulse shape, indicated as the blue shaded area in panels (d) and (h), appear visually 
indistinguishable. 

The correction to the pre-optimized guess pulse is shown in Fig 4. Indeed, the 
corrections are on the order of 1 MHz, much smaller than the peak amplitude of 
Eq ^ 400 MHz. The corrections follow a regular pattern, both in the shape (top 
panel) and the complex phase (center panel), as indicated by the existence of sharp 
peaks in the spectrum (bottom panel). It appears that the non-adiabatic defects in 
the pre-optimized guess pulse can be corrected by a series of small kicks in amplitude 
and phase at regular intervals. This results in an optimized pulse that is concep¬ 
tually simpler and yields a higher fidelity than the direct application of Krotov’s 
method. The comparison illustrates the power of a hybrid approach to steer the 
physical characteristics of the optimized control. In our example, the parametriza- 
tion used in the first optimization stage enforces the desired pulse shape. Without 
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this restriction, pulses of undesirable complexity are obtained. On the other hand, 
the restriction must ultimately be relaxed to allow for the representation of the 
higher-frequency features that correct non-adiabatic defects. Thus, the role of the 
two-stage optimization is also to influence the physical features of the control, in 
addition to its numerical benefits. 

The striking numerical efficiency of the hybrid optimization scheme can be seen 
by comparing its convergence to that of the direct optimization. This is shown in 
Fig. 5. The direct optimization shows an extended plateau for at least the first 
100 iterations, before slowly converging. This behavior is typical for ill-chosen guess 
pulses [43]. For a direct optimization with the functional, the plateau extends for 
several thousand iterations, and does not yield convergence within 10000 iterations. 
In contrast, the optimization starting from a pre-optimized pulse has no plateau 
(note the log-scale of the x-axis); both and converge at roughly the same 
rate. The improvement by the simplex (pre-)optimization compared to the original 
guess pulse can be seen from the difference in the y-intercept between the “direct” 
and “pre-optimized” curves in Fig. 5; in addition, there is also a reduction in the 
gate duration from 200 to 185 ns at no additional numerical cost. 

The gate duration for the geometric phase gate mechanism is limited by the 
requirement of adiabaticity. We have also performed the hybrid optimization scheme 
for a gate duration of 100 ns. In this case, the simplex search yields significant non- 
adiabatic defects, with a loss of population of 1.2 x 10“^. The concurrence error is 
2.3 X 10“^ and the total gate error, e^vg = 1.2 x 10“^, is dominated by the population 
loss. Post-optimization using Krotov’s method significantly reduces the total gate 
error to s^vg = 1.4 x 10“^. The post-optimization result is obtained at a numerical 
cost very close to that for the T = 200 ns gate, and yields pulse corrections very 
similar to those shown in Fig. 4. This results in a correction of the non-adiabatic 
defects and thereby lowers the population loss to only 6.5 x 10“^. The total gate 
error is now dominated instead by the increased concurrence error of 1.7 x 10“^, 
which is insufficient for a high quality phase gate. The observation that an overall 
improvement in gate performance at T = 100 ns from hybrid optimization is only 
possible by increasing adiabaticity at the cost of reduced entanglement, indicates 
that a quantum speed limit has been reached for the specific gate mechanism. These 
results show that a hybrid optimization scheme may be used successfully even when 
operating close to the quantum speed limit. 

While it is not the aim of this work to characterize the quantum speed limit for the 
coupled transmon system, we note that this could be quantified by systematically 
scanning over the gate duration [26]. Under the constraints of the specific gate 
mechanism employed here, a numerical approach appears necessary to extract the 
speed limit. However, for the Strauch gate of Ref. [44], an analytic estimate may be 
made from summing the minimal precession periods required for each component 
of the gate, i.e., two iSWAP and one controlled-Z gates. With the qubit-cavity 
interaction strength of ^ = 70 MHz employed here, this would suggest that gate 
durations as short as a few tens of nanoseconds might be attainable for this system. 

We may also compare the total number of propagations necessary to obtain the 
optimized pulse, included in Table 2 together with measures of success: concurrence 
error, loss of population from the logical subspace, and gate error. Each itera¬ 
tion using Krotov’s method requires two propagations. For the hybrid optimization 
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schemes (“pre-opt s.m.”, “pre-opt hoL”), the number of propagations includes sim¬ 
plex as well as the additional propagations due to Krotov’s method. The gate error 
is measured with respect to the exact target gate when using the square-modulus 
functional, and with respect to the closest geometric phase gate to the optimized 
dynamics in all other cases. In terms of numerical efficiency, the hybrid schemes out¬ 
perform the direct optimization by nearly two orders of magnitude while resulting 
in a significantly better gate fidelity. 

5 Conclusions 

For the example of a geometric phase gate on a system of two transmon qubits 
with a shared transmission line resonator, we have considered the application of 
gradient-based optimization methods, specifically Krotov’s method, and a gradient- 
free optimization method, Nelder-Mead simplex. The objectives of the geometric 
phase gate can be formulated in a specialized optimization functional that 
reaches its optimal value for any diagonal perfect entangler. We have shown that 
for a direct optimization, this functional vastly outperforms the “standard” square- 
modulus functional for gate optimization. Convergence is aided by the use of a 
functional that formulates the objective as general as possible. This is in agreement 
with recent results for optimization using a functional targeting arbitrary perfect 
entanglers [41, 45]. 

The direct optimization using Krotov’s method can in principle find controls that 
implement the desired gate with high fidelity. However, the resulting dynamics 
are complex and the numerical effort is dominated by an extended plateau in the 
initial phase of the optimization; for short gate durations, optimization becomes 
increasingly harder and the fidelity is limited by loss from the logical subspace. 
The numerical effort to reach high fidelities quickly becomes unfeasible in this case. 
In contrast, parametrizing the pulse by its duration and peak amplitude only, and 
applying a simplex optimization on those parameters, we are able to find a simple 
analytic pulse that implements the desired gate with moderate fidelity, still one 
order of magnitude above the quantum error correction limit. Thus, for the example 
presented here, neither the application of a gradient-based algorithm nor the simplex 
optimization alone yield satisfactory results; only the combination of both methods 
into a hybrid optimization scheme is able to obtain controls with a clear mechanism 
that implement a geometric phase gate to high fidelity, with a minimum of numerical 
effort. 

These results prompt the recommendation to generally adapt hybrid optimization 
schemes, i.e., obtain guess pulses for gradient-based optimization from a gradient- 
free pre-optimization, when there is insufficient knowledge to design good guess 
pulses by hand. There is great flexibility in the choice of parametrization. Here, we 
have taken the two free parameters, peak amplitude and gate duration, in a simple 
fixed analytical formula for the pulse shape. Generally, one might use slightly more 
sophisticated parametrizations, following e.g. the CRAB approach [18]. The small 
number of free parameters and relatively high quality of the original guess pulse with 
a fidelity of already > 90% results in particularly fast convergence of the simplex 
method. For a more sophisticated example, at least several hundred propagations 
would probably be required in the simplex stage. However, even in that case, this 
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numerical effort is by far outweighed by the large number of iterations required 
to leave the initial plateau in the optimization landscape for a bad guess pulse in 
a direct gradient-based optimization. Moreover, any figure of merit is suitable for 
optimization with the simplex method, as there is no need to derive the gradient for 
the optimization functional. For example, the gate duration can be included in the 
figure of merit, something that generally is not straightforward in gradient-based 
methods [46]. The hybrid scheme is aimed at providing optimal solutions in an 
open-loop context, and still leaves open the possibility of further combining open- 
loop and closed-loop optimization methods when targeting a specific experimental 
setup [47]. 

In principle, the approach could thus be extended to multiple stages, where in 
each stage, a different parameterization and a suitable optimization algorithm is 
used. There is no requirement for a specific method such as Nelder-Mead simplex 
or Krotov’s method to be employed. For example, a two-stage optimization using 
a genetic algorithm in the first stage, and a gradient algorithm in the second stage 
has been used for the optimization of quantum gates in strongly coupled two-level 
systems [48, 49]. There, starting with a simple parametrization, and then moving 
to a less restricted search space in the second optimization stage was also found to 
benefit the overall optimization performance, in agreement with the results shown 
here. 

Adding an additional stage might also be beneficial when the optimization land¬ 
scape is non-trivial and contains traps or saddle points. This may happen e.g. when 
the optimization is performed with limited resources [50]. Repeating the simplex 
search from a different starting point—either by systematic variation or by random 
search—may then find solutions that do not get stuck in traps of the optimization 
landscape. Of course, the truncation of the search space due to the low-dimensional 
parametrization may itself introduce additional traps in the landscape. However, 
these traps would disappear when returning to the full search space in the final 
optimization stage. 

For the specific example of the geometric phase gate considered here, we have 
found the post-optimization to introduce small corrections to non-adiabatic defects. 
Due to the small relative strength of the correction, the experimental realization of 
the geometric phase gate would require extraordinary precision. Moreover, the large 
excitation of the cavity would limit the fidelity when dissipative effects, specifically 
spontaneous decay of the cavity are taken into account. However, we stress that the 
method of employing hybrid optimization schemes presented here is entirely general 
and aimed at reducing the numerical effort in obtaining high fidelity solutions to ar¬ 
bitrary quantum control problems, not limited to quantum information processing. 
Moreover, robustness with respect to both fluctuations in the control parameters 
and dissipation can be achieved using complimentary advanced control techniques, 
such as a description in Liouville space and ensemble optimization [43, 31]. These 
approaches are numerically even more demanding, such that the reduction of the 
optimization cost achieved by the hybrid scheme discussed here may become im¬ 
perative. 
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Figure 1 Population dynamics under guess and optimized pulse. The figure shows the 
population dynamics of the initial state |^} (t = 0) = |00} for the guess pulse, panels (a)-(d), and 
the pulse obtained from direct optimization of the J|f° functional, panels (e)-(h). In panel (a), 
(e), expectation value (n) of the cavity excitation, plus-minus the standard deviation an- In panels 
(b), (f), and (c),(g), expectation values and standard deviations for the excitation of the right and 
left qubit, respectively. In panel (d), (h), population in the state |00}. The pulse shape 
(normalized by the peak amplitude Eq = 300 MHz, cf. Fig. 2) is shown in the background of 
panels (d), (h). The guess pulse implements a geometric phase gate with a gate error of 
£avg = 8.3 X 10"^, with concurrence error ec = 1-9 x 10"^ and population loss from the logical 
subspace £pop = 5.9 x 10'^. The optimized pulse decreases the gate error to £avg = 1-4 x 10"^, 
with ec = 1-8 X 10'^ and Spop = 1.4 x 10"^, see Table 2. 
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Figure 3 Population Dynamics under (pre-optimized) guess and optimized pulse. The figure 
follows the conventions of Fig. 1. Panels (a)-(d) show the dynamics resulting from a simplex 
optimization, see text for details. The resulting pulse is the starting point for a continued 
optimization using Krotov’s method with the functional. The optimized dynamics are shown 
in panels (e)-(h). The pulse amplitude indicated in the background of panels (d), (h) is 
normalized to the peak amplitude of Eq ^ 400 MHz. The simplex-optimized pulse implements a 
geometric phase gate with a gate error of £avg = 1.4 x 10"^, with sc = 2.0 x 10"^ and 
£pop = 1-4 X 10'^. The continued optimization decreases the gate error to £avg = 3.4 x 10"^, with 
Sc = 5.1 X 10"^ and Spop = 1.1 x 10"^, see Table 2. 
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Figure 4 Pulse corrections obtained with Krotov's method. The figure summarizes the 
differences between the optimized pulse, cf. panel (h) in Fig. 3, and the (pre-optimized) guess 
pulse, cf. panel (d) in Fig. 3, also indicated by the dashed red/gray line in each panel. The panels 
from top to bottom show the corrections to absolute value, complex phase, and spectrum of the 
pulse, respectively (solid black lines). The amplitude, phase, and spectrum of the guess pulse 
(dashed red/gray lines) are shown using the alternative axis scaling in red/gray. 
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Figure 5 Convergence of optimization towards a geometric phase gate. Value of the final-time 
optimization functional J|f°, respectively , over the number of iterations using Krotov’s 
method. Each iteration requires two full propagations; the number of propagations are 
proportional to the required CPU time. The direct optimization starts from an arbitrary guess 
pulse, see section 3.2 for details. In the pre-optimized case, the guess pulse was the result of a 
simplex optimization, see section 4.2 for details. 



Table 1 Parameters for two transmon qubits coupled via a shared transmission line resonator 


cavity frequency 

(jOc 

8.10 

GHz 

left qubit frequency 

UJI 

6.85 

GHz 

right qubit frequency 

UJ2 

7.25 

GHz 

drive frequency 


8.14 

GHz 

qubit anharmonicity 

CXl , CX2 

-300 

MHz 

qubit-cavity coupling 

91:92 

70 

MHz 


Table 2 Optimization success for different optimization schemes. For each scheme, we give the gate 
duration T, the total number of propagations, the concurrence error sc = ^ — C by which the gate 
differs from a perfect entangler, the loss of population Spop from the logical subspace, and the gate 
error 6avg = I — -Favg with respect to a geometric phase gate. The number of propagations for 
‘‘pre-opt. s.m.” and “pre-opt. geo.” include both the 116 propagations of the first stage simplex 
optimization and the propagations from the second-stage optimization using Krotov’s method, with 
201 respectively 92 iterations, and two propagations per iteration. The reported number of 
propagations is thus proportional to the total CPU and wall-clock time required to obtain the result 
starting from the original guess pulse. 
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simplex 

185 
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1.95 X 10-5 

1.40 X 10-2 

1.40 X 10-2 

pre-opt. s.m. 

185 
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